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Abstract 

Generalizing the novel clause elimination procedures developed in JT], we introduce explicit 
(CCE), hidden (HCCE), and asymmetric (ACCE) variants of a procedure that eliminates covered 
clauses from CNF formulas. We show that these procedures are more effective in reducing CNF 
formulas than the respective variants of blocked clause elimination, and may hence be interesting as 
new preprocessing/simplification techniques for SAT solving. 

o 

^ ! 1 Introduction 

O 

Simplification techniques applied both before (i.e., in preprocessing) and during search have proven 
integral in enabling efficient conjunctive normal form (CNF) level Boolean satisfiability (SATE solving 
' for real-world application domains. Further, while many SAT solvers rely mainly on Boolean constraint 

propagation (i.e., unit propagation) during search, it is possible to improve solving efficiency by applying 
additional simplification techniques also during search. Noticeably, when scheduling combinations of 
simplification techniques during search, even quite simply ideas can bring additional gains by enabling 
^ ■ further simplifications by other techniques. 

Generalizing the clause elimination procedures developed in [1], in this paper we introduce explicit 
(CCE), hidden (HCCE), and asymmetric (ACCE) variants of a clause elimination procedure that elimi- 
nates what we call covered clauses from CNF formulas. We compare these procedures to the analogous 
^ . variants BCE, HBCE, and ABCE (see Sect. II. lb of blocked clause elimination HHH w.r.t. relative ejfec- 

! tiveness. 

. Definition 1. Assume two clause elimination procedures Si and S2 that take as input an arbitrary CNF 

formula F and each outputs a CNF formula that consists of a subset ofF that is satisfiability-equivalent 
to F. Procedure S\ is at least as effective as 52 if, for any F and any output S\ (F) and S2(F) of Si and 
S2 on input F, respectively, we have that Si (F) C S2(F); S2 is not as effective as Si if there is an F for 
which there are outputs Si(F) and S2{F) of Si and S2, respectively, such that Si(F) C S2{F); and Si is 
more effective than S2 if(i) Si is at least as effective as S2, and (ii) S2 is not as effective as Si. 

This definition of relative effectiveness takes into account non-confluent elimination procedures, i.e., 
procedures that do not generally have a unique fixpoint and that may thus have more than one possible 
output for a given input. In fact, we show that out of the three covered clause elimination procedures, the 
explicit variant CCE is confluent. Extending the relative effectiveness hierarchy presented in HI (see the 
solid arrows in Fig. [D, we show that the variants of covered clause elimination are more effective than 
their counterparts based on blocked clauses (see the dashed arrows in Fig. [[]). In this sense, the elimi- 
nation procedures introduced in this paper are proper generalizations of the techniques analyzed in HI. 
This is interesting since it has been recently shown in |2] that already BCE is surprisingly effective, 
as it can — purely on the CNF level — implicitly perform a combination of structure-based circuit-level 
techniques, including the polarity-based Plaisted-Greenbaum CNF encoding and difference circuit sim- 
plifications. Here, the most effective technique is the asymmetric variant of covered clause elimination. 



*Supported by Dutch Organization for Scientific Research under grant 617.023.61 1. 
^Supported by Academy of Finland under grant #132812. 
1 We assume that the reader is familiar with basic concepts related to CNF satisfiability. When convenient we view a clause 
as a set of literals and a CNF formula as a set of clauses. 
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Figure 1: Relative effectiveness hierarchy of clause elimination procedures. An edge from X to Y means 
that X is more effective than Y. A missing edge from X to Y means that X is not as effective as Y. 
Transitive edges are missing from the figure for clarity. The dashed arrows present results of this paper. 

1.1 Variants of Blocked Clause Elimination 

The resolution rule states that, given two clauses C\ = {l,ai, . . . ,a n } andC2 = {T,b2, ■ ■ ■ ,b m }, the implied 
clause C = {a\, . . . ,a„,bi, . . . ,b m }, called the resolvent of C\ and Ci, can be inferred by resolving on the 
literal /, and write C = C\ ®/ C%. 

We compare the clause elimination procedures based on covered clauses to the following proce- 
dures [1] that eliminate blocked clauses [3 ]. Notice that out of these three, only BCE is confluent HI. 



BCE Given a CNF formula F, a clause C and a literal I G C, the literal / blocks C w.r.t. F if (i) for each 
clause C' G F with I G C, C ®i C is a tautology, or (ii) I G C, i.e., C is itself a tautolog>H. Given a CNF 
formula F , a clause C is blocked w.r.t. F if there is a literal that blocks C w.r.t. F. Removal of blocked 
clauses preserves satisfiability 0. For a CNF formula F, blocked clause elimination (BCE) repeats the 
following until fixpoint: if there is a blocked clause C G F w.r.t. F, let F := F\ {C}. The CNF formula 
resulting from applying BCE on F is denoted by BCE(F). 



HBCE Given a CNF formula F, we denote by F2 the set of binary clauses contained in F . For a 
given clause C G F, we denote by (hidden literal addition) HLA(F,C) the unique clause resulting from 
repeating the following clause extension steps until fixpoint: if there is a literal Iq G C such that there is 
a clause (Z V /) G F 2 \ {C} for some literal /, let C := C U {I}. For a CNF formula F, a clause C G F 
is called hidden blocked if HLA(F,C) is blocked w.r.t. F. Hidden blocked clause elimination (HBCE) 
repeats the following until fixpoint: if there is a hidden blocked clause C G F, remove C from F . 



ABCE For a clause C and a CNF formula F, (asymmetric literal addition) ALA(F,C) denotes the 
unique clause resulting from repeating the following until fixpoint: if l\ , . . . , 1% G C and there is a clause 
(h V • • • V k V /) G F \ {C} for some literal /, let C := C U {/}. A clause C G F is called asymmetric 
blocked if ALA(F,C) is blocked w.r.t. F. Asymmetric blocked clause elimination (ABCE) repeats the 
following until fixpoint: if there is an asymmetric blocked clause C G F, let F := F \ {C}. 



2 Covered Clause Elimination Procedures 

Given a CNF formula F, a clause C, and a literal / G C, the set of resolution candidates of C w.r.t. / 
is RC(F,C,Z) := {C \ C G Fj and C (g>/ C" is not a tautology}. Notice that every clause in RC(F,C,Z) 
contains the literal /. If RC(F,C,Z) = 0, then C is blocked w.r.t. F. The literals apart from / which occur 
in all clauses of RC(F,C,/) form the resolution intersection RI(F,C, I) of / and C w.r.t. F, defined as 

RI(F,C,/):= (nRC(F,C,/))\{/}. 

2 Here I e C is included to handle the special case that for any tautological binary clause (Z V/), both / and I block the 
clause. Even without this addition, every non-binary tautological clause contains at least one literal that blocks the clause. 
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Given a CNF formula F, a clause C € F, and a literal / € C, we say that / covers the literals in RI(F, C, /) 
(w.r.t. F and C). A literal I' is covered by / € C if /' G RI(F,C, /). A literal / € C is covering w.r.t. F and 
C if / covers at least one literal, i.e., RI(F,C,Z) ^ 0. 

Lemma 1. For any CNF formula F, clause C € F, ant/ literal I 6 C, if /joWs replacing C by CD 
Rl(F,C,l) in F preserves satisfiability. 

Proof. For any literal / e C it holds that VE(F, /) = VE((F \ j_C}) U {C U RI(F, C, /)}, /), where VE(F, I) 
denotes the CNF formula resulting from variable eliminatingj the variable of the literal / from F. □ 

For a given clause C in a CNF formula F, we denote by (covered literal addition) CLA(F,C) the clause 
resulting from repeating the following until nxpoint: if there is a literal I EC such that RI(F,C,/) \C ^ 0, 
letC:=CURI(F,C,/). 

Lemma 2. Replacing a clause C € F by CLA (F,C) preserves satisfiability. 

Proof. The clause CLA (F,C) is obtained by iteratively applying Lemma[T]on clause C. □ 

Lemma 3. Assume two clauses C,Dwith I € C C D and two sets of clauses F, G with F C G. Further as- 
sume that D is not blocked w.r.t. F and hence C is not blocked w.r.t. G. Then RC(G,C,l) 3 RC(F,D,Z) 7^ 
and hence RI(G, C, I) C RI(F, D, /). 

Proof. Monotonicity of RC w.r.t. its first argument and anti-monotonicity w.r.t. its second argument 
follows directly from its definition. For RI, note that intersection is anti-monotonic for non-empty sets 
of sets. □ 

Theorem 1. Given a CNF formula F and a clause C € F, CLA(F,C) is blocked or uniquely defined. 

Proof. Assume C is not blocked w.r.t. F and contains two literals h,h, which cover the literals L' t = 
RI(F,C,/j) respectively. Consider the clauses C\ = CUL\ and C2 = CUL' 2 . Now assume that both of 
Ci,C2 are not blocked w.r.t. F. Then all clauses D € RC(F,Ci,/2) Q RC(F,C,/2) contain all literals in L' 2 . 
SinceCi is not blocked and thus RC(F,Ci,/2) is not empty, we obtain L' 2 C RI(F,Ci,/2)- The case where 
the indices are exchanged (i.e., L\ C RI(F,C2,/i)) is symmetric. Thus as long clauses do not become 
blocked, covered literals can be added independently. The case that both of C\,C 2 are blocked is trivial. 

What remains (by symmetry) is the case that C2 is blocked but C\ is not. Again, as above, we get 
L' 2 CRI(F,Ci,/ 2 ). ForCj = Ci U RI(F, C x , l 2 ) we have C[ = CUL[ URI(F,Ci,/ 2 ) DL[U(CUL 2 ) D C 2 
which is also blocked. This generalizes to the following observation: For any non-deterministic choice 
of adding covered literals to C, the literal I2 remains covering. Further, if in this process the clause did 
not become blocked, it will eventually become blocked if the covered literals of 1% are added. □ 

To illustrate the effect of adding covered literals on logical equivalence^, consider the formula 
Fcla = (aVoVc) A(aVbVd) A(flVcVJ) A(aVWc) A(flViVrf) A(aVcVrf). 

Notice that RI(F CL a, (flV&Vc),fc) = {d} and RI(F CL a, {a Vo Vc),c) = {d}. Therefore, depending 
on the order of addition, CLA (Fcla , (a V o V c) ) is either (aV bV c V d) when starting with covering literal 
b or (a V b V c V d) when starting with covering literal c. In both cases CLA(F<xa , {a V b V c) ) is blocked. 
After replacing (a V b V c) by (a V b V c V d) the truth assignment % with %[a) = z(b) = z(c) = false and 
z(d) =true satisfies the new formula, while falsifying (aVoVc) G Fcla- hi fact, Fcla witnesses the fact 
that none of the clause elimination procedures introduced next preserve logical equivalence in general. 

3 More formally, VE(F, /) = (F/ ® Fr) U (F \ (F; U Ff)), where F; and Fj consist of the clauses in F that contain / and I, 
respectively, and F/ ® Fj = {C ®/ C' | C e F; , C' £ F^, and C g5; C' is not a tautology}. 

4 In this context, two formulas F and F' are logically equivalent if they have exactly the same set of satisfying assignments 
when restricting these assignments to the variables appearing in both F and F'. 
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2.1 Covered Clause Elimination 



Definition 2. Given a CNF formula F, a clause C € F is covered ifCLA(F,C) is blocked w.r.t. F. 
Lemma 4. Removal of an arbitrary covered clause preserves satisfiability. 

Proof. C can be replaced by CLA(F, C) (Lemma[2]), and C can be removed as CLA(F, C) is blocked. □ 

For a given formula F, covered clause elimination (CCE) repeats the following until fixpoint: if there 
is a covered clause C € F, remove C from F . The resulting unique formula is denoted by CCE(F). 
Confluence of CCE follows from the following lemma. 

Lemma 5. The following holds for any CNF formula F, clause C £ F, and set of clauses S C F such 
that C $lS. IfCis covered w.r.t. F, then C is covered w.r.t. F\S. 

Proof. Let CLA(F,C) = C k , where C := C, and C i+l := C, URI(F,Q, /,) for each i = 0..k - 1 and /,• € Q. 
Now define Do := C and, for each i = 0..A; — 1, D, + i := if is blocked w.r.t. F\S and D; + i := £> ( U 
RI (F \ S, Di , li ) otherwise. Using Lemma[3l one can show by induction that for each i we have either (i) D ( 
is blocked w.r.t. F\ S, or (ii) RI(F\5,A,/i) 5 RI(F,Q,/,). If (i) holds for some i, then CLA(F\S,C) is 
blocked w.r.t. F\C. If D t is not blocked w.r.t. F \ S for any /, then CLA(F \S,C)D CLA(F, C) . □ 

Theorem 2. CCE confluent. 

Theorem 3. CCE is more effective than BCE 

Proof. CCE is at least as effective as BCE follows from the fact that C C CLA(C): if C is blocked, so 
is CLA(C). Moreover, in i^cLA no clause is blocked. However, all clauses are covered. Hence BCE will 
not remove a single clause, while CCE removes all of them. □ 

2.2 Hidden Covered Clause Elimination 

For a given CNF formula F, a clause C G F is hidden covered if the clause resulting from repeating 
1. C := CLA(F,C); 2. C := HLA(F,C) until fixpoint is blocked w.r.t. F. Hidden covered clause elimi- 
nation (HCCE) repeats the following until fixpoint: if there is a hidden covered clause C in F, remove C 
from F. 

Lemma 6. Removal of an arbitrary hidden covered clause preserves satisfiability. 

Proof. Follows from the facts that (i) F is satisfiability equivalent to (F \ {C}) U {CLA(F,C)}; (ii) F is 
satisfiability equivalent to (F \ {C}) U {HLA(/ 7 ,C)}; and (iii) BCE preserves satisfiability. □ 

Theorem 4. HCCE is more effective than CCE. 

Proof. HCCE is at least as effective as CCE follows from the fact that C C HLA(F,C): if C is covered, 
so is HLA(F,C). Moreover, consider the formula 

^HCCE = (a V b) A (a V c) A (a V d) A (a V e) A (b V c) A (b V d) A (b V e) A (c V d) A (c V e) A (d V e) . 

In Fhcce no clause is covered. However, all clauses are hidden covered. Hence CCE will not remove a 
single clause, while HCCE removes all of them. □ 

By replacing CCE and BCE by HCCE and HBCE in the proof of Theorem [3] we have the following. 

Theorem 5. HCCE is more effective than HBCE. 
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2.3 Asymmetric Covered Clause Elimination 

For a given CNF formula F, a clause C € F is called asymmetric covered if the clause resulting from 
repeating 1. C := CLA(F,C); 2. C := ALA(F,C) until fixpoint is blocked w.r.t. F. Asymmetric covered 
clause elimination (ACCE) repeats the following until fixpoint: if there is a hidden covered clause C in 
F, remove C from F . 

Lemma 7. Removal of an arbitrary asymmetric covered clause preserves satisfiability. 

Proof. Follows from the facts that (i) F is satisfiability equivalent to (F \ {C}) U {CLA (_F,C)}; (ii) F is 
satisfiability equivalent to (F \ {C}) U {ALA(F,C)}; and (iii) BCE preserves satisfiability. □ 

Theorem 6. ACCE is more effective than ( i) ABCE, and ( ii) HCCE 

Proof, (i) By replacing CCE and BCE by ACCE and ABCE in the proof of Theorem [3] 

(ii) ACCE is at least as effective as HCCE follows from the fact that HLA(F,C) C ALA(F,C): if 
HLA(,F,C) is covered, so is ALA(F,C). Moreover, consider the formula 

^ACCE = (flVWc) A(aVfeVc) A(a\/bVc) A(a\/b\/c) A(aVfcVc) A(aVWc) A 
(a V b V c) A (a V b V c) A (a V b V d) A {a V b V d) A {a V b V d) A (a V £ V d) 

In Facce no clause is hidden covered. However, ACCE can remove (aVbV c) and (aVfcVc). □ 

3 Discussion and Conclusions 

Our current preliminary implementation of CCE requires on average twice the computational cost of 
BCE on the 2009 SAT Competition application benchmark set when applied until fixpoint. This implies 
that CCE can be made quite fast in practice. Regarding the practical effectiveness of CCE, on about 
half of the instances, CCE(F) is approximately the same size as BCE(F) (the difference is less than 10 
clauses). However, on the other half the additional reduction is about 5% compared to BCE; for the best 
case, we observed one instance for which the additional reduction was as high as 40%. 

As further work on this subject, we will focus on studying the effectiveness of CCE further in prac- 
tice, and also possibilities of implementing HCCE and ACCE. Here it is important to notice that, even 
when a specific elimination technique is too costly for practical purposes to be run until fixpoint, such 
a technique may be of practical use in a restricted form, i.e., by only applying it on long clauses or for 
a restricted time. Also, we will measure the effect of applying these elimination techniques on solving 
interesting benchmark formulas. On the more foundational side, we will study how to reconstruct solu- 
tions for a CNF formula F from solutions to any CCE(F), HCCE(F) and ACCE(F); this is important 
for practical applications since CCE, HCCE, and ACCE do not preserve logical equivalence. 
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